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Abstract 

Cell aggregation in unicellular organisms, induced by either cell non-sexual adhesion to yield floes and 
biofilm, or pheromone-driving sexual conjugation is of great significance in cellular stress response, medi- 
cine, and brewing industries. Most current literatures have focused on one form of cell aggregation termed 
flocculation and its major molecular determinants, the flocculation (FLO) family genes. Here, we imple- 
mented a map-based approach for dissecting the molecular basis of non-sexual cell aggregation in 
Saccharomyces cerevisiae. Genome-wide mapping has identified four major quantitative trait loci (QTL) 
underlying nature variation in the cell aggregation phenotype. High-resolution mapping following up 
with knockout and allele replacement experiments resolved the QTL into the underlying genes (AMN1 , 
RGA1, FL01 , and FLOS) or even into the causative nucleotide. Genetic variation in the QTL genes can 
explain up to 46% of phenotypic variation of this trait. Of these genes, AMN1 plays the leading role, differ- 
ing from the FLO family members, in regulating expression of cell clumping phenotype through inducing 
cell segregation defect. These findings provide novel insights into the molecular mechanism of how cell 
aggregation is regulated in budding yeast, and the data will be directly implicated to understand the mo- 
lecular basis and evolutionary implications of cell aggregation in other fungus species. 
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1 . Introduction 

Eukaryotic cells possess a remarkable capacity to 
aggregate together and form into a clump, which 
is crucial for the assembly of tissues and nervous 
systems, embryonic development, and cellular com- 
munication. Saccharomyces cerevisiae, the single-cell 
eukaryote, is also able to agglomerate cells into orga- 
nized structures such as floes, biofilm, and pseudohy- 
phae. These non-sexual clumps create protective 
shields for cells in inner parts to escape from abiotic/ 
biotic attacks,^ respond to nutrient starvation,^'^ and 
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have been widely used in industrial productions.'^ 
Thus, understanding the underlying mechanisms of 
the cellular behaviour has significant medical and evo- 
lutionary implications. However, most studies in the lit- 
erature have focused on flocculation, a non-sexual type 
of cell clumping, induced by interaction between cell 
wall glycoproteins encoded by highly homologous floc- 
culation (FLO) genes (FL01 , FLOS, FL09, and FLOIO).^'^ 
In particular, FLO 7 was reported to be a key gene whose 
expression led to strong flocculation' and was tran- 
scriptionally activated by FLOSJ'^ So far, these studies 
followed a reverse genetic approach to screen for the 
flocculins that bound specific sugar residues on 
the surface of other cells and led to formation of cell 
flocs.^''° However, the reverse approach has been 
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debated for its requirement of prior information on the 
underlying pathways leading to phenotypic expression 
of the traits/ ' ignorance of the interactions between a 
gene of interest and the genetic background, and its in- 
capacity to detect crucial polygenic interactions on 
phenotype.' ^"^"^ 

We present here a forward genetic approach to 
uncover the molecular basis of genetic variation in 
non-sexual cell aggregation in a natural population 
of budding yeast (S. cerevisiae). The approach inte- 
grates mapping of quantitative trait loci (QTL) with 
major effects on the cell aggregation trait, resolving 
the major QTL into the QTL genes, identifying causa- 
tive nucleotides within the QTL genes, and character- 
izing molecular functions of these QTL genes and 
their interactive effects on the trait. 

2. Materials and methods 

2. 7 . Yeast strains and growth conditions 

All yeast strains used in the present study are listed in 
Supplementary Table SI. YHIA, an isogenic haploid 
strain of the standard reference strain S288C, was 
crossed with YLl C, also a haploid strain described else- 
where.^^ Fi hybrids from crossing the parental lines 
were sporulated to generate 292 segregants that 
were equivalent to gametes, forming an F2 generation. 
We created two other isogenic haploid strains from the 
parental lines with changed mating types through 
modifying their HO genes. We used the standard 
yeast media and growth conditions in mating, sporula- 
tion, and tetrad dissection experiments in the present 
study.^*^ 

2.2. Scoring phenotype of cell aggregation 

The method used to score phenotype of cell aggre- 
gation in this study was modified from that proposed 
by Guo et al.^ In detail, any strain preserved under 
-70°C was recovered by plate streaking, and three 
single clones for the strain were obtained. Each 
clone was cultured in a 5-ml YPD liquid medium at 
30°C at 280 rpm for 22-24 h to make the cells 
enter the stationary phase of growth. The cultures 
were swirled briefly with a vortex shaker before a sedi- 
ment test. Phenotype of the tested strain was 
recorded as a sedimentary time in hours (T), i.e. the 
time taken for the tested cells to sink completely in 
the culture buffer in a test tube. On the basis of the 
sedimentary time records, we grouped phenotype of 
all tested strains into four categories when contrasted 
with the sedimentary records of the parental strains, 
Tyhia and Tylio which are illustrated in Fig. la. 
Phenotype of a tested strain was scored as one if T> 
Tyhia, two if Tyuc < T< Tyhia, three if 0<T< 
7'ylic, or four if 7= 0. In other words, phenotype of 



cell aggregation was quantified as a categorical char- 
acter with multiple thresholds in the present study. 

In order for clear visualization of the cell aggrega- 
tion phenotype, we present here microscope images 
for each of the four categorical phenotypes. To gener- 
ate these images, the tested cells were first stained 
with Gram's iodine and then observed and photomi- 
crographed using a Leica fluorescence microscope. 
All images illustrated in this paper were printed at 
the same magnification. 

2.3. Marker development and genotyping 

We first searched the genome sequence of yeast 
S. cerevisiae (http://www.yeastgenome.org/) for short 
tandem repeats (STR) within the genome by making 
use of the computer software Tandem Repeats 
Finder 3.21.^^ We tested 561 evenly distributed STR 
sequences from the search as candidate markers 
and confirmed 2 64 STR loci that exhibited poly- 
morphism between the two parental strains. These 
polymorphic markers were used as the basic marker 
set in the primary mapping experiment. Primer 
sequences for amplifying these STR markers were 
detailed in our previous study together with the ex- 
perimental protocols for collecting and analysing 
STR genotype data.'^ Beyond these STR markers, we 
developed a second set of single nucleotide poly- 
morphism (SNP) markers to be used in the second 
stage of fine scale mapping of cell aggregation QTL 
by directly sequencing the relevant DNA regions. 
This added other 2 5 SNP markers in the QTL 
regions inferred from the primary QTL mapping (see 
marker locations and primer sequences in 
Supplementary Table S2). 

2.4. Gene knockout, gene replacement, 
and nucleotide-specific mutagenesis 

We implemented the standard polymerase chain 
reaction (PCR)-based gene disruption techniques to 
knock out the candidate genes.' ^'^^ Single- and 
dual-knockout strains were constructed by replacing 
the target genes with the anti-hygromycin and anti- 
nourseothricin genes. The triple knockout strain was 
selected from offspring segregants from crossing of 
the dual-and single-knockout parental strains. All 
knockout strains obtained in the present study were 
confirmed by PGR experiments. The primer sequences 
for these PGRs are available upon request from the 
corresponding authors. 

To perform the allelic exchange experiments and nu- 
cleotide-directed mutagenesis, we replaced the allele 
in situ with the replacement modules comprising the 
counterpart alleles and the Zeocin resistance gene as 
a dominant marker (detailed in Supplementary Figs 
S7 and S8). 
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Figure 1. QTL fine mapping, (a) Cell aggregation phenotype of two parental strains YH1A and YLIC and their representative offspring 
segregants labelled 1 -4. The phenotype was assayed as sediment time in hours (left) and by microscope viewing (right) at 
stationary phase of cell growth. Red horizontal line in the left panel marks the top boundary of cell sediment, (b) Mapping of major 
cell aggregation QTLs on the yeast chromosome II and XV and candidate genes in the QTL regions. The physical map information of 
the candidate genes was from Saccharomyces Genome Database (www.yeastgenome.org). 



2.5. Preparation ofRNA samples and RT-qPCR analysis 
Cells from a tested strain were cultured in 50 ml 
YPD liquid medium until reaching ODeoo = 0.8. 
Total RNA was isolated using the hot phenol proto- 
col,^° purified with RNase-free DNase (Promega), 
and subjected to first-strand cDNA synthesis with 
Superscript™ III Reverse transcriptase (Invitrogen). 
One microlitre of the single-strand cDNA after 10- 
fold dilution was used as template for real-time quan- 
titative PGR (RT-qPCR) with SYBR-green (Toyobo) and 
ACT1 as the internal control. Every tested strain was 
independently cultured three times to gain three 
independent samples, and each of the samples was 
assayed for three repeated times of qPCR. The oligo- 
nucleotide primers for the three genes assayed were 
RGAlf: 5'-CGGTACAGTCACTGGAATCGAA-3', RGAl r: 



5'-ACGAATCAAACCTGGAGCAAA-3'?AMN 1 f: 5'-AGGC 
AAATACACACAGGTGGAA-3', AMNlr: 5'-TTCCCAAATA 
CCGGCATCA-3'; and ACTINf: 5'-TCGTTCCAATTTACG 
CTGGTT-3', ACTINr: 5'-CGGCCAAATCGATTCTCAA-3') 
for RGAl , AMN1 , and ACT1 , respectively. 

We first sequenced the FLOl gene of the two paren- 
tal strains, YLl C and YHl A, and found that there were 
only 44 bp in the FL01 gene, which distinguish FLOl 
from the paralogous genes FLOS and FL09 
(Supplementary Fig. S6). The sense primer of qPCR 
was designed across the 44-bp specific region, 
whereas the antisense primer on a non-specific 
region in 87-bp downstream. The respective specific 
sequences were 'GCCTCATCGCTATATGTTTTTGG' and 
'GCTGGTAAGCACGCCTCTGT'. The PGR product was 
sequenced and its specificity was confirmed. 
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2.6. Preparation of protein samples and western 
blotting assay 
We followed the PCR-based epitope tagging method 
to insert 6His into the C-terminal of the target protein 
as an epitope.^' Proteins were extracted using the 
protocol described by Adams et al.'^ Western blotting 
was performed using the standard laboratory proce- 
dures with monoclonal antibody specific for 6His 
(Univ. Co, China). 



2.7. Yeast two-hybrid analysis 

Yeast two-hybrid assay in the present study was 
carried out according to the manufacturer's manual 
(Clontech, CA USA). Three alleles of AMN1 (i.e. 
AMNl^"^^,AMN1^'-^'=, and AMNl^^^^^) were cloned 
independently into the vector pGADTZ (GAD), 
whereas TEA/17 was inserted into the vector pGBKTZ 
(GBK). These vector constructs were confirmed by se- 
quencing before being introduced into the reporter 
strains AH109 and Y187. These strains were then 
screened for positive co-transformation of the paired 
plasmids through growing them on the SC-Trp-Leu 
plates, synthetic complete medium lacking trypto- 
phan and leucine. Positive interactions between 
AMNl and TEM1 were detected as successful growth 
of the co-transformed strains on high-stringency SC- 
Trp-Leu-His plates (synthetic complete medium 
lacking tryptophan, leucine, and histidine) in the 
AH109 system. The positive interaction was also 
confirmed independently through the quantitative 
p-galactosidase assay in the Yl 87 system. 



2.8. Data analyses 

We implemented a Bayesian approach based on 
Gibbs sampling as proposed in Sorenson et al.^'* to es- 
timate additive genetic variance component and, in 
turn, heritability of the multiple categorical pheno- 
type of cell aggregation. In the statistical analysis, y,-, 
phenotype of the /'^ segregant, is determined by a 
simple liability model w/ = yLt + c// + e/, i.e. j//=y (1 < 
j < 4), if ty_i < w/ < tj. In the liability model, jj. is the 
population mean, a, is a random variable of additive 
genetic effect of the segregant on its phenotype, and 
£/ is the residual or environmental random variable. 
We assumed that the random variables, a, and e,, 
follow independently a normal distribution with 
mean zero and variance and a standard normal 
distribution, respectively. The threshold parameters 
satisfy -oo = to < ti < t2 < ^3 < t4 = °°- We imple- 
mented the algorithm developed by Dr Zhaobang 
Zeng at North Carolina State University for carrying 
out multiple-interval mapping of QTL affecting the 
multiple categorical phenotype of cell aggregation.^^ 
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3. Results 

3.1 . Characterizing cell aggregation of yeast 

We first established a reliable and feasible method 
to quantify the aggregation phenotype, which was es- 
sential for a quantitative genetic analysis and further 
functional analysis with the cellular behaviour charac- 
ter. The left panel in Fig. 1 a shows sediment status of 
cells from the clumpy and non-clumpy parental 
strains YLIC and YHl A and their four representative 
segregants at three sediment time points, 0, 1.0, 
and 2.5 h. Thus, cell aggregation was, here, scored as 
a categorical quantitative trait, with the two parental 
strains, YHl A and YLIC, having phenotype categories 
1 and 3, respectively and their offspring being scored 
as one of the four possible categories. In particular, 
cells of the segregant four sunk immediately in the 
clumping buffer (0 h) and the segregant was scored 
as category 4, a phenotype outside of the range of 
its two parents, showing transgressive segregation of 
the complex cell behaviour character. This method 
of categorizing the clumping character coordinates 
very well with that of the microscope views of the 
cell floes, indicating the feasibility and reliability of 
the phenotyping assay. 



3.2. Mapping and dissection of cell aggregation QTL 

From crossing the parental strains, YHl A and YLl C, 
that are highly divergent in cell aggregation pheno- 
type, we created a full-sibling population comprising 
292 segregants. We scored the segregants for their 
clumping phenotype and genotyped them at 264 
segregating simple sequence repeat (SSR) markers. 
Based on the phenotype and marker genotype data, 
we implemented the multiple-interval mapping 
method^^ for a preliminary genome-wide scan for 
QTL affecting the multiple categorical phenotype of 
cell aggregation. The analysis detected four significant 
QTLs that mapped on the yeast chromosomes I, II, V, 
and XV, respectively. To narrow down these QTL 
regions, we genotyped the segregants at 1, 9, 4, and 
1 1 further SNP markers in the 3 QTL regions, respect- 
ively and reran the multiple-interval mapping ana- 
lysis. Because the chromosome I and V QTLs contain 
FLOl and FLOS, respectively (Supplementary Fig. SI), 
two previously known genes involved in flocculation 
in S. cerevisiae,^'^^'^^ our focus here was on the 
other two QTLs on the yeast chromosomes II and XV. 
Figure 1 b shows the likelihood profiles for the two 
major QTLs together with physical maps of candidate 
genes within the QTL regions. It can be seen that the 
fine mapping enables to resolve the chromosome II 
QTL into six candidate genes and the chromosome 
XV QTL into seven candidates. 
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3.3. Major-effect genes for cell aggregation and 
causative nucleotide In chromosome II QTL 
Among the six candidate genes within the chromo- 
some II QTL (Fig. 1 b), three (RIB7, RPB5, and CNSl) are 
essential genes. We first knocked out each of the 
remaining three (SLUS, ICS2, and AMN1) in both 
the parental strains, YHl A and YLl C, and then evalu- 
ated the knockout strains for their clumping pheno- 
type. We found that phenotype of the clumpy 
parental strain, YLIC, was dramatically altered after 
\tsAMN1 gene was knocked out (Fig. 2a). To further 
consolidate the candidacy of AMN1 as the QTL gene, 
we carried out an allelic exchange experiment, in 
which the AMN1 allele of the YLl C strain was replaced 
with that of the YH 1 A strain and vice versa. The aggre- 
gation phenotype of YLl C was clearly reduced after its 
AMNl allele was replaced by that of YHl A (Fig. 2b). 
Moreover, the non-clumpy strain, YHl A, showed cell 
aggregation when its AMNl allele was replaced by 
that from YLIC (Fig. 2b). These findings provide the 
strong evidence supporting that AMNl is indeed the 
major clumping QTL gene. It should be noted that 
effects of replacing AMNl alleles are not reciprocally 
constant between the two parental strains. 



We sequenced the AMNl alleles from the parental 
and several other laboratory strains and found 
three polymorphic sites within the Amnl (Fig. 3a). 
Meanwhile, we assayed their expression at both 
transcriptional and translational levels through RT- 
qPCR and western blotting assays. Although the 
AMNl gene is expressed significantly differentially 
between the two parental strains at both the 
mRNA and protein levels (Fig. 3c), the polymorph- 
isms in the open reading frame (ORF) are sufficient 
to lead to phenotypic variation according to the 
results of allelic exchange (Fig. 2b). 

We further used site-directed mutagenesis to iden- 
tify the causative quantitative trait nucleotide(s) in 
the QTL gene AMNl by replacing the AMNl gene of 
the YLl C strain with three different DNA segments of 
21 50 bp in length from the strain YHl A, accordingly. 
The segments covered the whole coding sequence of 
the AMNl gene with the YHIA origin, but differed 
only at each of the three mutant nucleotides. We 
assayed these genetically modified strains for their 
clumping phenotype and found that only the strain 
that carried the V368D substitution in the AMNl 
gene could recover the phenotype of YLIC 
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Figure 2. Cell aggregation phenotype of engineered parent strains across QTL genes. Cell aggregation phenotype that was scored by the 
sedimentary assay (the left panel) and microscope view (the right panel) for the two parental strains and their genetically modified 
strains through knock out or allele replacement. Red horizontal line in the left panel marks the top boundary of cell sediment, (a) 
The three QTL genes (AMNl , FLOS, and RGA1) were knocked out in the clumpy parent YLIC. 'A' represents deletion, i.g. 'amnl A' for 
deletion AMNl. (b) The alleles of three QTL genes were replaced in situ by their counterpart alleles between the parents. The 
superscripts 'A' and 'C refer to the YHlA-and YLl C-derived allele, respectively, i.g. AMNl^ represents the allele of AMNl derived 
from YHIA. 
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Figure 3. Genetic variation of QTL genes between two parental strains. Partial amino acid sequences of the cell aggregation QTL genes 
AMN1 (a) and RCA1 (b) with amino acid substitutions highlighted. The amino acid sequence in other yeast species is shown under 
the polymorphic residues when available: Saccharomyces paradoxus, Saccharomyces mikatae, Saccharomyces bayanus, Saccharomyces 
kudriavzevii, Saccliammyces kluyveri, Candida giabrata, Picliia angusta, Yarrowia lipolytica, and Kluyveromyces lactis. (c) Comparison in 
mRNA and protein expression of AA/1N7 between the two parental strains, YHIA and YLIC. 



(Supplementary Fig. S2a), strongly supporting the mu- 
tation as the causative quantitative trait nucleotide. 



3.4. V368D variation determined the cell aggregation 
phenotype through the MEN pathway 
It has been reported that/4A4iV7 plays a key role as a 
daughter-specific switch that helps cells exit from the 
mitotic exit and reset the yeast cell cycle through 
binding to Teml, a Ras-related GTPase. The TEM1 
gene activates a signal transduction pathway, mitotic 
exit network (MEN) to allow mitotic exit.^^ Based on 
this observation, we performed the yeast two-hybrid 
experiment in a standard strain AH109 and 
demonstrated that only Amnl^'-^'^ and Amnl^^*^*'^ 
can bind to Teml effectively (Fig. 4a). The quantitative 
p-galactosidase analysis yielded the results that were 
highly consistent with the two-hybrid analysis 
(Supplementary Fig. S3). Furthermore, we compared 
expression of the downstream genes DSE1 , DSE2, and 
SCWl 1 between YLl C and YH 1 A, using the Affymetrix 



microarray hybridization data described previously in 
Wang et al.^^ These genes were reported to underlie 
the cell wall degradation and daughter cell separ- 
ation, ^° and the expressions could be activated by 
Ace2, the transcriptional factor regulated by the MEN 
pathway.^ ^ Our study shows a significantly lowered 
level of expression of these genes in YLIC than in 
YHIA (Fig. 4b), suggesting that the transcription of 
DSEl, DSE2, and SCW) 1 would be inhibited in the 
clumpy parent strain, YLl C. Then, we constructed two 
strains both derived from YLl C, which differed only at 
the 368th amino acid residue of Amnl . The expression 
of these downstream genes DSEl , DSE2, and SCWl 1 
exhibited significant difference between the strains 
(Fig. 4c). These observations support the hypothesis 
that the V368D substitution in the Amnl of the 
clumpy parent YLIC enhanced its binding to Teml 
and, in turn, blocked the MEN pathway, reduced tran- 
scription of the downstream genes DSE1 , DSE2, and 
SCW1 1 , and finally resulted in inhibition of daughter 
cells' separation and formation of clumps (Fig. 4d). 
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Figure 4. Functional verification oi AMNI underlying yeast cell aggregation, (a) Yeast two-hybrid assay on interaction between the AMNI 
and TEM 1 genes, (b) Expressional assay of the three downstream genes, DSE1 , DSE2, and SCW1 ? , from the two parental strains, (c) The 
microscopic view and the expression assay of DSE1 , DSE2, and SCW1 1 between two strains derived from YLIC, only differing in the 
368th amino acid residue of Amnl. (d) A hypothetical model on how the single nucleotide substitution (V368D) of Amni affects 
the MEN signal transduction pathway, which, in turn, determines daughter cell separation and formation of cell clumps. 



3.5. RGAl as the chromosome XV clumping QTL gene 
To dissect the cliromosome XV QTL, we screened for 
the candidate(s) that responded to knockout and gene 
replacement treatments. Of the seven ORFs within the 
QTL, only when the RGA1 gene of the clumpy strain 
YLIC was silenced through knockout, its clumping 
phenotype was visibly altered (Fig. 2a). In addition, the 
phenotype was also changed significantly when its 
RGAl allele was replaced with that from YHl A (Fig. 2b). 
These observations strongly support the candidacy of 
RGAl as another major QTL gene. We compared the 
Rgal sequence between the two parental strains and 
detected four polymorphic sites (Fig. 3b). We failed to 
find a single nucleotide substitution that could explain 
the observed change in the phenotype, although we 



found that the third polymorphic site in the gene had a 
significant causative effect on invasive growth of the 
yeast cells (Supplementary Fig. S4). Furthermore, we 
compared, but did not detect any significant change in 
expression of the RGAl alleles between the two parental 
strains at either the transcriptional or translational level 
(Supplementary Fig. S5). 

Rgal, a member of the Cdc42 GTPase-activating 
protein family, has been reported as an apical and 
isotropic switch in controlling the budding pattern of 
yeast cells by influencing hydrolytic status.^^'^^ In the 
presence of normally functional RGAl, Cdc42-GTP 
hydrolyzes and becomes Cdc42-GDP, leading to axial 
budding of cells that shows round cell morphology. 
The cells show bipolar budding and elongated 
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RGAIwt&AMNIwt RGA1 vat & amnIA AMN1 vnt & rgalA amnlA&rgalA 

Figure 5. Diagrammatic illustration of sequential effects of the two underlying QTL genes, AMN? and RGA1 . The two genes affect yeast cell 
aggregation through controlling cell separation and budding pattern, respectively. The black line indicates the effect of AMN1 on the 
aggregation trait and the blue the effect of RGA1 on the trait. The solid (or dotted) lines specify the genetic effects of the QTL genes 
in a normal (or dysfunctional) form. 



morphology when their RGA1 is silenced. However, it 
should be stressed that effect of the RGA1 on cell 
budding and morphology would be only partial 
because there exist other family members such as 
RGA2 and BEM3. Their presence may compensate for 
defects of the silent RGA1 .^'^ In light of the observation, 
we examined joint effects of the two verified QTL genes, 
RGA1 and AMN1 , on cell aggregation. Figure 5 illus- 
trates sequential influences of the two major clumping 
QTLgenes, andAMNl ,on budding pattern, cellu- 
lar morphology, and cell separation of the clumpy 
strain YLl C. When the AMN1 gene alone was knocked 
out whereas RGA1 remained normal, the strain 
showed cell separation and largely round shape. 
When only RGA1 was silenced whereas AMN1 func- 
tioned normally,daughtercellsof the strain were inhib- 
ited to separate from mother cells. Some of the cells 
then displayed bipolar budding and apical attachment 
between mother-daughter cells in elongated shape, 
resulting in smaller clumps, and the presence of 
strings comprises a few apically linked cells. Whenever 
both the genes were knocked out in the clumpy 
strain, cells were promoted to bud in bipolarization, 
and inhibition of cell separation was released, leading 
to formation of bar-like cell clumps. These explain 
why single-knockout strain of AMNl has a more 
defected phenotype than dual-knockout strain. 

3.6. FLOl, FLOS, and transgressive segregation 

Our QTL analysis detected two significant QTL 
regions containing the two genes relevant to floccula- 
tion, FLOl and FLOS. To test their candidacy as major 
cell aggregation genes, we first compared sequence of 



the two genes between the two parental strains, YHl A 
and YL1C, and found that the FLOl allele in the 
clumpy strain YLIC had eight fewer tandem repeats 
of an intragenic domain than those in the non- 
clumpy parent YHl A. This disagrees with the previous 
observations that the degree of cell flocculation was 
found to be proportional to the number of tandem 
repeats of the intragenic domain structure.' '^^ 
Comparison of genie sequence of FLOS between the 
two parental strains revealed a nonsense substitution 
at the 42 5th nucleotide in YHl A, consistent with the 
previous study.^^ The mutation creates a stop codon 
(TAG) and results in a truncated protein. The defective 
FLOS in YHl A cannot activate expression of its FLOl , 
thus explaining why the FLOl gene did not lead to 
stronger flocculation as previously expected, although 
the non-clumpy strain carried the allele enhancing 
flocculation phenotype. 

We knocked out the FLOl and FLOS genes individual- 
ly in YHl A and YLl C and observed that the clumping 
phenotype of YLl C was markedly altered after either 
FLOl or FLOS gene was silenced (Supplementary Fig. 
S2b). Moreover, we replaced the FLOS allele in YLIC 
with the YHlA-derived counterpart and observed a 
similar phenotype to the FLOl or FLOS deletion strain 
(Supplementary Fig. S2b). The data showed that 
geneticvariationof both the FLO genes contributed sig- 
nificantly to the clumpy phenotype. We attempted but 
failed to replace the FLOl gene in the same way as 
reported above. This can be attributed to the difficul- 
ties arising from the location of the FLOl gene at the 
chromosomal end with rich heterochromatin, thus 
keeping the gene in a protected status from genetic 
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Figure 6. Genetic interactions of the QTL genes. 'A' and 'C in (a) 
and (b) refer to as the YH1A- and YLIC-derived alleles, 
respectively, (a) Expression of FLOl and FLOS in two parental 
strains and their two groups of offspring segregants. The 
genotype of each strain at the four QTL genes is shown below, 
accordingly, (b) Distribution of segregants with different cell 
aggregation phenotypes (1 -4) within each of genotypes at the 
four QTL genes. Each column represents a group of individuals 
shared with the same genotype at the four QTL genes, (c) Cell 
aggregation phenotype of the two parental strains and their 
two selected offspring segregants (S2 and S5) after different 
treatments of ethylenediaminetetraacetic acid and CaCl2. 
Panel 1 (control) corresponds to where cells of the tested 



modification. Secondly, the FLO/ gene has an extreme- 
ly high levelof homology with its family members FLOS 
and FL09, making it extremely difficult to ensure speci- 
ficity in replacing the target gene. 

Although it was highly technically challenging to 
profile expression of the FL01 gene due to its extreme- 
ly high level of sequence homology with other FLO 
family members FLOS and FL09, we succeeded in 
assaying expression of the FLO gene in the two paren- 
tal strains through an F/.0/ -specific RT-qPCR experi- 
ment (see Supplementary Fig. S6 for detail). We 
analysed the FLO 7 and FLOS expression levels of the 
parental strains and their eight offspring individuals 
with two different genotypes at the QTL genes and 
varying phenotypes. FLOS expression showed no dif- 
ference in the tested strains. Moreover, a highly 
expressed FLOl gene was common for the strong 
cell clumping phenotype (the phenotype categories 
3 and 4) but was certainly not an indicator of dumpi- 
ness because the gene was also observed to be highly 
expressed in the non-clumpy strains (those with 
phenotype category 1 ) (Fig. 6a). 

Intriguingly, 30 extremely clumpy segregants (T= 0) 
exhibited more strongly clumpy than parental strains, 
which is termed as transgressive segregation. To 
explore how the QTL genes affect the transgressive seg- 
regation, we first grouped the 2 92 segregants into 4 
panels of 1 6 groups according to their allelic origin or 
joint genotype at the 4 QTL genes. Figure 6b illustrates 
distributions of the segregants with different pheno- 
types (1 -4) within each of the four-gene genotypes 
and shows that the segregants receiving the four QTL 
genes from the clumpy parent, YLIC, have a much 
lower probability of expressing extreme clumpy pheno- 
type (phenotypic category 4) than those with a YHIA- 
derived allele of FL0 1 , indicating a dispersion distribu- 
tion of cell aggregation causing alleles at the QTL 
genes between the two yeast parental strains and 
thus explaining the transgressive segregation observed 
in the segregant population. Moreover, the phenotypic 
effect of the FLOl allele was highly background de- 
pendent. This FLOl allele would not cause the clump- 
ing phenotype in the absence of the other three QTL 
genes from the clumpy parent YLIC (group 7 vs. 
group 3 in Fig. 6b), but plays the role of clumping 
enhancer. 



strains were scored without being treated, panel 2 (0.1 M 
ethylenediaminetetraacetic acid) to where the cells were 
washed twice with 0.1 M ethylenediaminetetraacetic acid and 
other two times with sterile water before being tested, and 
panel 3 (0.1% CaCl2) to where the cells were treated in the 
same way as those in panel 2, but tested in the flocculation 
buffer with 0.1% CaCl2 added. Red horizontal line represents 
the top boundary of cell sediment. 
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Given the fact that the flocculins leading to cell floc- 
culation share a common three-domain structure and 
require Ca^+ ions to stabilize the structure,^ we 
explored the effect of Ca^"*" ions on clumping pheno- 
types of the two parental strains and their two segre- 
gants (S2 and S5) that showed the transgressive 
phenotype. Figure 6c shows that the cell aggregation 
phenotype of the strains S2 and S5 was reduced to 
the extent of YLIC when Ca^"*" ions were chelated 
with the ethylenediaminetetraacetic acid treatment 
(panel 2), but recovered when the ions were added 
as CaCia in the clumping buffer, suggesting that the 
transgressive phenotype can be attributed to FLO 
gene family. This assumption was confirmed by 
the further knockout experiments in which the 
transgressive phenotype was disrupted when either 
FL01 or FLOS was silenced in S2 and S5 strains 
(Supplementary Fig. S2b). Strikingly, the treatment 
did not result in altered phenotype of the clumpy par- 
ental strain (YLl C). This together with the fact that this 
parent carried the non-clumpy FUJI allele indicates 
that the phenotypic response to the Ca^+ treatment 
actually reflects response of the Ca^^-dependent cell 
aggregation induced by the cell wall protein, which 
differs from that mediated hyAMNI. 

We assessed relative contribution of the QTL genes to 
phenotypic variation in cell aggregation and found 
that genetic variation in AMN1 explained 25% of the 
phenotypic variation, whereas the other three QTL 
genes FLOS, RGA1 , and FL01 , respectively, explained 
1 7, 5, and 1 1 % of the variation. Variation between ag- 
gregative genotypes at all the four genes explained 
46% of the phenotypic variation. To further explore 
the joint and interactive effects of the genes, we gener- 
ated three dual-knockout strains (YLl C-amnl ASifloSA, 
YUC-amnl A&rgal A, and ^UC-floSA&rgal A) and 
one triple-deletion strain (YLlC-omn? A&//oSA& 
rgal A). Phenotypes of these engineered strains are 
shown together with other single QTL gene knockout 
strains and the wild-type strain in Fig. 2. Although the 
degree of cell aggregation of the single-gene knockout 
strains was further reduced when they were knocked 
out for additional one or two of the QTL genes, it is 
clear that different configurations of the knockout 
QTL genes have led to different degrees of dumpiness. 
This, together with the observation illustrated in 
Fig. 6b, reveals that the QTL genes show clearly epistat- 
ic effects on cell aggregation. 



4. Discussion 

The genetic architecture underlying a quantitative 
trait is determined by the number of genes that con- 
tribute to their additive and interactive effects on vari- 
ation of the trait phenotype.^^ Although many 



genomes have been fully sequenced and the 
number of genes with annotated function increases 
rapidly in the era of functional genomics, it still 
remains a great challenge to reconstruct the architec- 
ture with a full insight of the genetic parameters. 
Budding yeast, the single-cell eukaryote, but with 
probably the richest structural and functional infor- 
mation of the genome, provides an ideal model for 
dissecting the molecular basis of genetic variation in 
typical quantitative traits such as high-temperature 
growth^ ^ and sporulation efficiency^^ and in genetic 
control of transcriptome abundance.^^ The present 
study extends these successful examples by dissecting 
multiple-threshold phenotypic variation of yeast cell 
aggregation into a genie level. 

Several genes have been reported previously in the 
literature to control clumping phenotype variation 
in budding yeast. Of these, the most prominent are 
the FLO gene family responsible for many cellular ad- 
hesion phenotypes, including flocculation.^° In sharp 
contrast, the present study reveals the two novel 
genes, AMN1 and RGA1 , in addition to previously 
identified two genes, FLOl and FLOS, that are signifi- 
cant determinants for the phenotype. Success of the 
present study in resolving the mapped QTLs into the 
underlying QTL genes can be attributed to that the 
major QTLs have been mapped at the high resolution 
by which direct gene targeting becomes possible. We 
noted that the simulation study by Li et al. predicted 
a much larger confidence interval for mapping cat- 
egorical traits from a simulated backcross population 
with 200 individuals genotyped at genetic markers of 
every 10 cM. When compared with the simulation 
study, the present analysis has employed a substan- 
tially larger sample size (2 92 segregants) and higher 
density of markers in the QTL regions. These together 
with the fact that the budding yeast has a highly 
recombiogenic and compact genome paved the 
basis for the mapping resolution we have achieved. 

The combined effect of these QTL genes explains 
46% of the overall phenotypic variation, leaving the 
remaining 54% of the variation to be residual. The re- 
sidual variation can be explained as two parts as well 
as their interaction, the genetic component whose 
underlying genetic effects are too small to be 
detected by the experiment, and the component con- 
tributed by environmental modification to the quan- 
titative trait. In addition, we demonstrated that the 
QTL genes exhibited clear epistatic effects on the ag- 
gregation trait. These show that the underlying mech- 
anism of yeast cell aggregation is more complicated 
than expected in the previous literature. 

Of these QTL genes, AMN 1 plays the leading role in 
regulating cell aggregation. We provide strong and 
direct experimental evidence that supports this is 
through the genes modulating the mitotic exit 
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network in S. cerevisiae}^ We have identified the 
AMN1 as a major QTL gene of cell aggregation 
through a genome-wide QTL scan in a natural popula- 
tion and provided here the direct experimental evi- 
dence for the V368D substitution as the causative 
genetic variant. Our results also strongly indicate the 
putative domain of AMN1 , and the functional signifi- 
cance of V368D, which has been recently predicted as 
a superfamily being highly conserved (e-value of 
6.38e-96) across species.^^ On the other hand, the 
RGA1 gene, a member of the GTPase-activating 
family, is first identified in the present study as a 
major gene affecting cellular clumping in budding 
yeast. This gene is required for yeast cells to switch 
from the apical to isotropic growth phase.'^" The 
present study creates an opportunity to explore the 
joint effect of this cell aggregation QTL gene, a 
budding pattern controller, and another QTL gene 
AMN1 , also a mitotic exit regulator. Genetic modifica- 
tion to these genes through gene knockouts disperses 
the well-known grape-like strings of cells (Fig. 5). 

In addition to the above two major genetic factors, 
FL01 and FLOS were also detected as the QTL genes 
through the present forward genetic analysis. Highly 
expressed FL01 , a cell surface adhesion regulator, has 
been widely recognized as a biomarker for cell floccu- 
lation in many candidate-based analysis. However, we 
present here the role of the gene, as merely one of 
the building elements in the genetic architecture 
underlying the complex phenotype, and show that 
FL01 is highly background dependent in determining 
the phenotype of its carrier. Unlike what has been 
widely accepted in the literature of yeast cell floccula- 
tion, expressed FL01 does not necessarily confer a 
clumping phenotype to its carriers. We demonstrate 
that the FL01 allele from a non-clumpy parental 
strain, in the presence of the other three QTL genes 
from a clumpy parental strain, creates the genotype 
for the clumping phenotype that substantially sur- 
passes performance of both the parents, i.e. transgres- 
sive segregation of the QTL genes. Taking all these 
together, the QTL genes unveiled in the present study 
have been shown to be involved in multiple pathways 
that govern the cellular development (RGA1), the be- 
haviour of daughter cells (AMNl), and the cell- 
surface adherence property {FL01 and FLOS). In con- 
trast to the studies in which biological functions and 
their phenotypic significance of these QTL genes 
were examined either independently or jointly, the 
present study unveils the QTL genes as major determi- 
nants of the clumping phenotype in a systematic way. 
This clearly indicates the power and appropriateness 
of the forward genetic approach employed in the 
present study in uncovering the genetic basis of an ag- 
gregate polygenic trait such as cellular clumping that 
involves multiple constituent components. 



The evolutionary significance of cellular clumping 
has recently been highlighted in the single-cell 
species because the clumping phenotype can protect 
inner individuals in cell clumps from biotic or/and 
abiotic stresses such as alcohol, anti-microbials, and 
physical attacks. The FL01 gene is recognized as a 
'green beard gene' because its induced expression in 
a non-flocculent laboratory strain, S2 SBC, can restore 
its cells to flocculate, and the cellular floes, thus, gener- 
ated provide a cooperative shield to protect from stres- 
ses in the living environment.' Our data demonstrate 
that expression of FL01 may not be the biomarker for 
a clumping phenotype, although an unexpressed 
FL01 allele from the non-clumpy strain, which is iso- 
genic to S228C, may enhance clumping of its offspring 
strains carrying other major clumping genes. In the 
polygenic system uncovered here, /4A4i\/7 is found to 
be a more important genetic determinant than FL01 
in controlling expression of the clumping phenotype, 
and this endogenously expressed gene may make a 
more appropriate target for investigating the molecu- 
lar mechanism underlying the 'green beard' effect. 
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